per 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
Interaational Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) InteniatioDal Patent Oassiflcation ^ 

C12N 15/12, C07K 13/00 
C12N 5/10, GOIN 33/74 



Al 



(11) Inteniatioiial Publication Number: WO 93/19175 

(43) iDternational PoblicatioD Date: 30 September 1993 (30.09.93) 



(21) International Application Number: PCT/EP93/00697 

(22) International Filing Date: 23 March 1993 (23.03.93) 



(30) Priority data: 
398/92 



25 March 1992 (25.03.92) DK 



(71)(72) Applicant and Inventor: THORENS, Bernard [CH/ 
CH]; 70, Grand-Chemin, CH-I066 Epalinges (CH). 

(74) Agent: NOVO NORDISK A/S; Patent Department Novo 
A116, DK-2880 Bagsvaerd (DK). 



(81) Designated States: AU, BB, BG, BR, CA, CZ, FI, HU, JP, 
KP, KR, KZ, LK, MG, MN, MW, NO, NZ, PL, RO, 
RU, SD, SK. UA, US, VN, European patent (AT, BE, 
CH, DE, DK, ES, FR, GB, GR, IE, IT. LU, MC, NL, 
PT, SE), OAPI patent (BF, BJ, CP, CG, CI, CM, GA. 
GN, ML. MR, NE, SN, TD. TG). 



Published 

With international search report. 
Before the expiration of the time limit for amending the 
claims and to be republished in die event of the receipt of 
amendments. 



(54) Title: RECEPTOR FOR THE GLUCAGGN-LIKE-PEPTIDE-1 (GLP-l) 




(57) Abstract 



4 6 8 

GLP-l (nM) 



The present invention relates to a recombinant glucagon-like peptide- 1 (GLP-l) receptor, to a DNA construct which com- 
prises a DNA sequence encoding a GLP-l receptor, to methods of screening for agonists of GLP-l activity, and to the use of the 
GLP-l receptor for screening for agonists of GLP-l activity. 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 



applications under the PCT. 










AT 


Aualria 


PR 


Krance 


MR 


Mauriiaiua 


AU 


Australia 


GA 


Gabon 


MW 


Malawi 


BB 


Barbados 


GB 


Uniuai Kingdom 


NL 


Ncthcrlaiulsi 


BC 


Belgium 


GN 


Guinea 


NO 


Norway 


BP 


Burkina Faso 


GR 


Greece 


NZ 


New Zealand 


BG 


Bulgaria 


HU 


Hungary 


PL 


Poland 


BJ 


Benin 


IE 


Ireland 


PT 


Portugal 


BR 


Bra/il 


IT 


Italy 


RO 


Romania 


CA 


Canada 


JP 


Japan 


RU 


Russian Federation 


CP 


Central African Republic 


KP 


Democratic Peopte'& Republic 


SD 


Sudan 


CC 


C^ngo 




or Korea 


SE 


Sweden 


CH 


SwibEcrlood 


KR 


Republic of Korea 


SK 


Slovak Republic 


a 


C*6le lilvoirc 


KZ 


Kazakhstan 


SN 


Senegal 


CM 


Cameroon 


U 


Liccblunslein 


SU 


Soviet Union 


cs 


C^oxhusiovakia 


LK 


Sri i^nka 


TO 


Chad 


cz 


Cnxh Kupubltc 


I.U 


{juxLinbourg 


TC 


Togo 


DE 


Germany 


MC 


Monaco 


UA 


Ukraine 


OK 


Denmark 


MC 


Madagascar 


US 


United States of America 


ES 


Spain 


ML 


Mali 


VN 


Viet Nam 


Pi 


Finland 


MN 


Mongolia 







wo 93/19175 



PCr/EP93/00697 



1 

RECEPTOR FOR THE GLUCAGON-LIKE-PEPTIDE-1 (GU>-1) 
FIELD 07 THE INVENTIOM 

The present invention relates to a recombinant glucagon-like 
peptide-1 (GLP-1) receptor , to a DNA construct which comprises 
5 a DNA sequence encoding a GLP-1 receptor, to methods of 
screening for agonists of GLP-1 activity, and to the use of the 
GLP-1 receptor for screening for agonists of GLP-1 activity. 



BACR6EOUMD OF THE IMVENTZON 

As used in the present specification the designation 6LP-1 
10 comprises 6LP-l(7-37) as well as GLP-1 (7-36) amide. 

Glucose- induced insulin secretion is modulated by a number of 
hormones and neurotransmitters. In particular, two gut 
hormones, glucagon-liJce peptide-1 (GLP-1) and gastric 
inhibitory peptide (GIP) potentiate the effect of glucose on 

15 insulin secretion and are thus called gluco-incretins (Dupre, 
in The Endocrine Pancreas, E. Samois Ed. (Raven Press, New 
York^ (1991), 253 - 281) and Ebert and Creutzfeld, (Diabetes 
Metab. Rev. 1, (1987)). Glucagon-like peptide-1 is a gluco- 
incretin both in rat and in man (Dupre and Ebert and 

20 Creutzfeld, vide supra , and Kreymann et al. (Lancet 2. (1987), 
1300)). It is part of the preproglucagon molecule (Bell et al. 
Nature 304 (1983), 368) which is proteolytically processed in 
intestinal L cells to GLP-1 (1-37) and GLP-1 (7-36) amide or GLP- 
1(7-37) (Mojsov et al. (J.Biol.Chem. 261 (1986), 11880) and 

25 Habener et al.: The Endocrine Pancreas E. Samois Ed. (Raven 
Press, New York (1991), 53 - 71). Only the truncated forms of 
6LP-1 are biologically active and both have identical effects 
on insulin secretion in beta cells (Mojsov et al. J. Clin. Invest 
29 (1987), 616) and Weir et al. (Diabetes 38 (1989), 338). They 

30 are the most potent gluco-incretins so far described and are 
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active at concentrations as low as one to ten picomolar. The 
stimulatory effect of these gluco-incretin hozmones requires 
the presence of glucose at or ahavB the normal physiological 
concentration of about 5 inM and is mediated by activation of 

5 adenylate cyclase and a rise in the intracellular concentration 
of cyclic AMP (Drucker et al. Proc. Natl, Acad .Sci. USA 84 
(1987), 3434) and Goke et al. (Am. J.Physiol. 257 (1989), G397) . 
GLP-1 has also a stimulatory effect on insulin gene 
transcription (Drucker et al. Proc. Natl. Acad. Sci, USA 84. 

10 (1987) , 3434) . In a rat model of non- insulin-dependent diabetes 
mellitus (NZDDM) is associated with a reduced stimulatory 
effect of GLP-l on glucose-induced insulin secretion (Suzuki et 
al. Diabetes 19 (1990), 1320>. In man, in one study, GLP-1 
levels were elevated in NIDDH patients both in the basal state 

15 and after glucose ingestion; however, following a glucose load 
there was only a very small rise in plasma insulin 
concentration (0rskov et al. J. Clin. Invest. 87 (1991), 415). 
A recent study (Nathan et al. Diabetes Care 15 (1992), 270) 
showed that GLP-1 infusion could ameliorate postprandial 

20 insulin secretion and glucose disposal in NIDDM patients. Thus, 
as a further step in understanding the complex modulation of 
insulin secretion by gut hormones and its dysfunction in 
diabetes, we isolated and characterized a complementary DNA for 
the beta cell GLP-1 receptor and showed that it is part of a 

25 new family of 6-coupled receptors. 

DB8CRXPTZON OF THE INVENTION 

The present invention relates to a recombinant glucagon-like 
peptide-1 (GLP-1) receptor. 

More preferably, the invention relates to a GLP-l receptor 
30 which comprises the amino acid sequence shown in SEQ ID No. 1, 
or an analogue thereof binding GLP-l with an affinity constant, 
Kq, below 100 nM, preferably below 10 nM. In the present 
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context, the term "analogue" is intended to indicate a 
naturally occurring variant (including one expressed in other 
animal species, in particular human) of the receptor or a 
"derivative" i.e, a polypeptide which is derived from the 
5 native GLP-1 receptor by suitably modifying the DNA sequence 
coding for the variant, resulting in the addition of one or 
more amino acids at either or both the C- and N-terminal ends 
of the native amino acid sequence, substitution of one or more 
amino acids at one or more sites in the native amino acid 
10 sequence, deletion of one or more amino acids at either or both 
ends of the native sequence or at one or more sites within the 
native sequence, or insertion of one or more amino acids in the 
native sequence. 

In another aspect, the present invention relates to a DNA 
15 construct which comprises a DNA sequence encoding the GLP-l 
receptor of the invention, as well as a recombinant expression 
vector carrying the DNA construct and a cell containing said 
recombinant expression vector. 

In one embodiment of the invention, the GLP-1 receptor molecule 
20 may be provided in solubilised and/or reconstituted form. 

In the present context "solubilised" is intended to indicate a 
receptor as present in detergent-soltibilised membrane 
preparations. "Reconstituted" is intended to indicate a 
receptor solxibilised in the prescence of essential cof actors, 
25 e.g. G-protein. In this embodiment the receptor may be in a 
reconstituted micellar form. 

The DNA construct of the invention encoding the GLP-1 receptor 
preferaibly comprises the DNA sequence shown in SEQ ID No. 1, or 
at least a DNA sequence coding for a functional analogue 
30 thereof binding GLP-l with an affinity below 100 nM, preferably 
below 10 nM or a suitable- modification thereof. Examples of 
suitable modifications of the DNA sequence are nucleotide 
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sxibstitutions which do not give rise to another amino acid 
sequence of the GLP-1 receptor, but which may correspond to the 
codon usage of the host organism into which the DNA construct 
is introduced or nucleotide substitutions which do give rise to 

5 a different amino acid sequence and therefore, possibly, a 
different protein structure without, however, impairing the 
properties of the native variant. Other examples of possible 
modifications are insertion of one or several nucleotides into 
the sequence, addition of one or several nucleotides at either 

10 end of the sequence, or deletion of one or several nucleotides 
at either end or within the sequence. 

Another example of a DNA construct of the invention is one 
which encodes a GLP-1 receptor variant particularly suitable 
for solubilisation and reconstitution. 

15 The DNA construct of the invention encoding the present GLP-1 
receptor may be prepared synthetically by established standard 
methods, e.g. the phosphoamidite method described by Beaucage 
and Caruthers, Tetrahedron Letters 22^ (1981), 1859 - 1869, or 
the method described by Hatthes et al., EMBO Journal 2 (1984), 

20 801 - 805. According to the phosphoamidite method, 
oligonucleotides are synthesized, e.g. in an automatic DNA 
synthesizer, purified, annealed, ligated and cloned in suitable 
vectors. 

The DNA construct of the invention may also be of genomic or 
25 cDNA origin, for instance obtained by preparing a genomic or 
cDNA libraiY screening for DNA sequences coding for all or 
part of the GLP-1 receptor of the invention by hybridization 
using synthetic oligonucleotide probes in accordance with 
standard techniques (cf. Sambrook et al.. Molecular Cloning: A 
30 Laboratory Manual, 2nd Ed., Cold Spring Harbor, 1989). In this 
case, a genomic or cDNA sequence encoding the 6LP-1 receptor 
may be modified at a site corresponding to the site(s) at which 
it is desired to introduce amino acid substitutions, e.g. by 
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site-directed mutagenesis using synthetic oligonucleotides 
encoding the desired amino acid sequence for homologous 
recombination in accordance with well -known procedures. 

Finally, the DNA construct may be of mixed synthetic and 
5 genomic, mixed synthetic and cDNA or mixed genomic and cDNA 
origin prepared by ligating fragments of synthetic, genomic or 
cDNA origin (as appropriate) , the fragments corresponding to 
various parts of the entire DNA construct, in accordance with 
standard techniques. The DNA construct may also be prepared by 
10 polymerase chain reaction using specific primers, for instance 
as described in US 4,683,202 or Saiki et al.. Science 239 
(1988), 487 - 491. 

The recombinant expression veefeor into which the DNA construct 
of the invention is inserts may be any vector which may 

15 conveniently be subjected to recombinant DNA procedures, and 
the choice of vector will often depend on the host cell into 
which it is to be introduced. Thus, the vector may be an 
autonomously replicating vector, i.e. a vector which exists as 
an extrachromosomal entity, the replication of which is 

20 independent of chromosomal replication, e.g. a plasmid. 
Alternatively, the vector may be one which, when introduced 
into a host cell, is integrated into the host cell genome and 
replicated together with the chromosome (s) into which it has 
been integrated. 

25 In the vector, the DNA sequence encoding the GLP-1 receptor of 
the invention should be operably connected to a suitable pro- 
moter sequence. The promoter may be any DNA sequence which 
shows transcriptional activity in the host cell of choice and 
may be derived from genes encoding proteins either homologous 

30 or heterologous to the host cell. Examples of suitable pro- 
moters for directing the transcription of the DNA encoding the 
6LP-1 receptor of the invention in mammalian cells are the SV40 
promoter (Subramani et al., Mol. Cell Biol. 1 (1981) , 854 - 
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864) , the (metallothionein gene) promoter (Palmiter et 

al.. Science 222 (1983), 809 - 814) or the adenovirus 2 major 
late promoter. A suitable promoter for use in insect cells is 
the polyhedrin promoter (Vasuvedan et al., FEES Lett. 311 . 

5 (1992) 7 - 11) . Suitable promoters for use in yeast host cells 
include promoters from yeast glycolytic genes (Hitzeman et al«, 
J. Biol. Chem. 255 (1980), 12073 - 12080; Alber and Kawasaki, 
J, Mol. Appl. Gen. 1 (1982), 419 - 434) or alcohol 
dehydrogenase genes (Young et al., in Genetic Engineering of 

10 Microorganisms for Chemicals (Hollaender et al, eds.) , Plenxim 
Press, New York, 1982), or the TPIl (US 4,599,311) or ADH2-4C 
(Russell et al.. Nature 304 (1983), 652 - 654) promoters* 
Suitable promoters for use in filamentous fungus host cells 
are, for instance, the ADH3 promoter (McKnight et al. , The EHBO 

15 J. iL (1985) , 2093 - 2099) or the toiA promoter. 

The DNA sequence encoding the GLP-1 receptor of the invention 
may also be operably connected to a suitable terminator, such 
as the hiaman growth hormone terminator (Palmiter et al., op. 
cit. ) or (for fungal hosts) the TPIl (Alber and Kawasaki, op. 

20 citi.) or ADH3 (McKnight et al. , op. cit. ) terminators » The vec- 
tor may further comprise elements such as polyadenylation 
signals (e.g. from SV40 or the adenovirus 5 Elb region) , 
transcriptional enhancer sequences (e.g. the SV40 enhancer) and 
translational enhancer sequences (e.g. the ones encoding 

25 adenovirus VA RNAs) • 

The recombinant expression vector of the invention may further 
comprise a DNA sequence eneODling the vector to replicate in the 
host cell in cpiestion. An example of such a sequence (when the 
host cell is a mammalian cell) is the SV40 origin of 
30 replication. The vector may also comprise a selectable marker, 
e.g. a gene the product of which complements a defect in the 
host cell, such as the gene coding for dihydrofolate reductase 
(DHFR) or one which confers resistance to a drug, e.g. 
neomycin, .hygromycin or methotrexate. 
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The procedxires used to ligate the DNA sequences coding for the 
GLP-l receptor of the invention, the promoter and the ter- 
minator, respectively, and to insert them into suiteUale vectors 
containing the information necessary for replication, are well 
5 known to persons skilled in the art (cf . , for instance, 
Sambrook et al. , op.cit. ) • 

The host cell into which the expression vector of the invention 
is introduced may be any cell which is capable of producing the 
GLP-1 receptor of the invention and is preferably a eukaryotic 

10 cell, such as invertebrate (insect) cells or vertebrate cells, 
eajcLa. Xenopus laevis oocytes or mammalian cells, in particular 
insect and mammalian cells. Examples of suitable mammalian cell 
lines are the COS (ATCC CRL 1650), BHK (ATCC CRL 1632, ATCC CCL 
10), CHL (ATCC CCL39) or CHO (ATCC CCL 61) cell lines. Methods 

15 of transfecting mammalian cells and expressing DNA sequences 
introduced in the cells are described in e.g. Kaufman and 
Sharp, J. Mol. Biol. 159 (1982), 601 - 621; Southern and Berg, 
J. Mol. Appl. Genet, i (1982), 327 • 341; Loyter et al., Proc. 
Natl. Acad. Sci. USA 79 (1982), 422 - 426; Wigler et al.. Cell 

20 14 (1978), 725; Corsaro and Pearson, Somatic Cell Genetics 7 
(1981), 603, Graheun and van der Eb, Virology 52 (1973), 456; 
and Neumann et al., EMBO J. i (1982), 841 - 845. 

Alternatively, fungal cells (including yeast cells) may be used 
as host cells of the invention. Examples of suitable yeasts 

25 cells include cells of Saccharomvces spp. or Schizo- 
saccharomyces spp., in particular strains of Saccharomvces 
cerevisiae. Examples of other fungal cells are cells of fila- 
mentous fungi, e.g. Aspergillus spp. or Neurospora spp., in 
particular strains of Aspergillus orvzae or Aspergillus niger . 

30 The use of Aspergillus spp. for the expression of proteins is 
described in, e.g., EP 272 277. 

The GLP-l receptor according to the invention may be produced 
by a method which comprises culturing a cell as described above 
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in a suitable nutrient medium under conditions which are 
conducive to the expression of the GLP-1 receptor, and re- 
covering the GLP-1 receptor from the culture. The medium used 
to culture the cells may be any conventional medium suitable 
5 for growing meanmalian cells, such as a serum-containing or 
serum-free medium containing appropriate supplements. Suitable 
media are available from commercial suppliers or may be 
prepared according to published recipes (e.g. in catalogues of 
the American Type Culture Collection) . 

10 If the GLP-l receptor has retained the transmembrane and (pos- 
sibly) the cytoplasmic region of the native varicuit, it will be 
anchored in the membrane of the host cell, and the cells 
carrying the GLP-l receptor may be used as such in the 
screening or diagnostic assay. Alternatively, the receptor may 

15 be a component of membrane preparations, e.g. in solubilised 
and/ or reconstituted form as defined abavB. 

In a still further aspect, the present invention relates to a 
method of screening for agonists or enhancers of GLP-1 
activity, the method comprising incubating a GLP-1 receptor 

20 according to any of claims 1-3 with a substance suspected to 
be an agonist of GLP-l activity and subsequently with a GLP-1 
or an analogue thereof, and detecting any effect from the 
suspected agonist on the binding of GLP-l to the GLP-1 
receptor. An enhancer being defined as a compound capable of 

25 stabilizing interaction between a high-affinity form of the 
receptor and the corresponding ligand, as described e.g. for 
the adenosin receptor (Bruns et al. Molecular Pharmacology 38 
(1990), 939). 

An alternative method of screening for agonists of GLP-1 
30 activity, comprises incubating GLP-1 or an analogue thereof 
with a substance suspected to be an agonist of GLP-1 activity 
and stibseguently with a GLP-l receptor of the invention, and 
detecting any effect on the binding to the GLP-1 receptor. Such 
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agonists of GLP-l activity will be substances stimulating 
glucose- induced insulin secretion and may be used in the 
treatment of NIDDM. 

The GLP-l receptor may be immobilized on a solid support and 
5 may, as such, be used as a reagent in the screening methods of 
the invention. The GLP-l receptor may be used in membrane- 
bound form, i.e. bound to whole cells or as a component of 
membrane preparations immobilised on a solid support. 

The solid support employed in the screening methods of the 
10 invention prefereibly comprises a polymer. The support may in 
itself be composed of the polymer or may be composed of a 
matrix coated with the polymer. The matrix may be of any 
suitable material such as glass, paper or plastic. The polymer 
may be selected from the group consisting of a plastic (e.g. 
15 latex, a polystyrene, polyvinyl chloride, polyurethane, 
polyacrylamide, polyvinylalcohol , nylon, polyvinylacetate, and 
any suitable copolymer thereof ) , cellulose (e.g. various types 
of paper, such as nitrocellulose paper and the like) , a silicon 
polymer (e.g. siloxane) , a polysaccharide (e.g. agarose or 
20 dextran) , an ion exchange resin (e.g. conventional anion or 
cation exchange resins), a polypeptide such as polylysine, or 
a ceramic material such as glass (e.g. controlled pore glass). 

The physical shape of the solid support is not critical, al- 
though some shapes may be more convenient than others for the 

25 present purpose. Thus, the solid support may be in the shape of 
a plate, e.g. a thin layer or microtiter plate, or a film, 
strip, membrane (e.g. a nylon membrane or a cellulose filter) 
or solid particles (e.g. latex beads or dextran or agarose 
beads) . In a preferred embodiment, the solid support is in the 

30 form of wheat germ agglutinin-coated SPA beads (cf. US 
4,568,649) . 
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Alternatively r screening for GLP-1 agonists can also be carried 
out using a cell line expressing the cloned GLP-l receptor 
functionally coupled to a G-protein. In living cells, es^osure 
to an agonist will give rise to an increase in the 
5 intracellular cAMP concentration. The cAMP concentration can 
then be measiired directly. Chemges in cAMP levels may also be 
monitored indirectly using appropriate cell lines in which a 
measurable signal is generated in response to an increase in 
intracellular cAMP. 

10 It is furthermore contemplated to locate the ligand-binding 
site on the GLP-l receptor of the invention, for inst2mce by 
prepsiring deletion or substitution derivatives of the native 
GLP-l receptor (as described above) and incubating these with 
ligands known to bind the full-length GLP-l receptor and 

15 detecting any binding of the ligand to the GLP-l receptor 
deletion derivative. Once the ligand-binding site has been 
located, this may be used to aquire further information about 
the three-dimensional stiructure of the ligand-binding site. 
Such three-dimensional structures may, for instance, be 

20 established by means of protein engineering, computer 
modelling, NMR technology and/or crystallographic techniques. 
Based on the three-dimensional structure of the ligand-binding 
site, it may be possible to design substances which eare 
agonists to the GLP-l molecule. 

25 The characterization of the GLP-l receptor is of considerable 
physiological and pathological importance. It will help study 
a fundamental aspect of the entero- insular axis (Unger and 
Eisentraut, Arch. Int. Med. 123 (1969), 261): the potentiating 
effect of gut hormones on glucose-induced insulin secretion, 

30 the role of these hormones in the control of glucose 
homeostasis and also the possible therapeutic use of GLP-l to 
stimulate insulin secretion in NIDDM patients (Hathan et al. 
Diabetes Care 15 (1992), 270). Investigation of the regulated 
expression and desensitization of the receptor in the normal 
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state and during the development of diabetes will contribute to 
a better understanding of the modulation of insulin secretion 
in normal and pathological situations. Availability of 
antibodies against this receptor may also allow an analysis of 

5 the surface localization of this receptor and its distribution 
relative to the beta cell glucose transporter GLUT2 (Thorens et 
al. Cell 55 (1988), 281 and Orci et al. Science 245 (1989), 
295) . This aspect pertains to the hypothesis that the beta cell 
membrane has a "regulatory" domain which contains hormone 

10 receptors (Bonner-Weir Diabetes 37 (1988), 616), and which may 
be distinct from GLCJT2-containing membrane domains previously 
identified (Thorens et al. Cell 5S (1988), 281 and Orci et al. 
Science 245 (1989), 295), Finally, the identification of an 
additional member of this new family of G-coupled receptors 

15 will help design experiments to probe the structure-function 
relationship of these new molecules. 

* BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is fujrther illustrated in the following 
examples with reference to the appended drawings in which 

20 Fig. lA and Fig. IB which is a continuation of Fig. lA together 
show the amino acid sequence of the rat 6LP-1 receptor in a 
comparison . with the sequence of the rat secretin receptor 
(SECR) , the opossium parathyroid hormone receptor (PTHR) and 
the porcine calcitonin receptor (CTRl) . The GLP-1 receptor has 

25 three N glycosylation sites in the extracellular domain 
(arrows) • Four cysteines are conserved at identical places in 
the four receptor (boxes) ♦ Note the otherwise very divergent 
sequences in this part of the molecules as well as in the COOH- 
terminal cytoplasmic tail. Sequence identities are denoted by 

30 stars and homologies by dots. The location of the putative 
transmembrane domains are indicated by horizontal bars above 
the sequences. 



SUBSTITUTE SHEET 



wo 93/19175 



PCT/EP93/00697 



12 

Fig. 2 shows binding of 125^^^^^^^^ ^05 cells transfected 
witih the p6LPR-16 plasmid. Specific binding reaches saturation 
at 1 to 10 iiM GLP-l. Insert: Scatchard emalysis of GLP-l 
binding. 

5 Fig. 3 shows binding of ^^^I-GLP-l to INS-1 cells. Specific 
binding reaches saturation at 1 to 10 nM GLP-l. Insert: 
Scatcdiard analysis of GLP-l binding. 

Fitting of the curves in Figs. 2 and 3 were performed with the 
LIGAND program (McPherson, Kinetic, EBDA, Ligand, Lowry. A 
10 Collection of radioligand analysis programs (Elsevier, 
Amsterdam, 1985} } • 

Fig. 4 shows displacement of -^^^I-GLP-l binding to COS cells 
transfected with the rat GLP-l receptor cDNA. Transfected cells 
were incubated with 20 pM ^^^I-GLP-1 in the presence of 
15 increasing concentrations of cold peptides. Each point was 
measured in duplicate and the experiments repeated three times 
for GLP-l, GIP and glucagon and once for VIP and secretin. 

Fig. 5 shows stimulation of cyclic AMP formation in COS cells 
transfected with the rat GLP-l receptor cDNA. COS cells were 
20 transfected with the pcDNA-1 vector alone (open bars) or the 
pGLPR-1 plasmid (stripped bar) and incubated in the absence or 
the presence of GLP-l at the indicated concentration. cAMP 
production was measured in triplicate with a radioimmunoassay 
(Amersham) • 

25 Fig. 6 shows tissue specificity of GLP-l receptor expression 
assessed by Northern blotting of RNA from different tissues and 
from the INS-1 cell line. Ten micrograms of total RNA was 
analyzed on each lane. Two major RNA species of 2.7 and 3.6 kb 
were detected in all tissues in which the receptor was 

30 detected. The position of the migration of the ribosomal RNAs 
is indicated to the left of the picture. 
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Fig. 7 is a comparison of the rat GLP-1 receptor amino acid 
sequence (rat) and a partial amino acid sequence of the human 
GLP-1 receptor (human) . 

The present invention is further illustrated in the following 
5 examples which is not intended to be in any way limiting to the 
scope of the invention as claimed. 

EXAMPLE 1 

Holecular Cloning and Characterisation of the Rat Islet 
Receptor cDNA, 

10 A rat pancreatic islet cDNA ; library was constructed in the 
pcDNA-1 expression vector (Rat pancreatic islets were prepared 
according to Gotoh et al. (Transplantation 43, (1985) , 725) • 
PolyA+ RNA was prepared and the cDNA library was constructed in 
the pcDNA-1 vector (In Vitrogen) as described by Aruffo and 

15 Seed (Proc. Natl. Acad. Sci. USA 84 (1987), 8573) and Lin et al. 
(Proc. Natl. Acad. Sci. USA 88 (1991), 3185). Plasmid DNA was 
prepared from pools of five to eight thousands bacterial clones 
(Maniatis et al.. Molecular Cloning. A Laboratory Manual. Cold 
Spring Harbor Laboratory, 1982) and transfected into COS cells 

20 (Sompayrac and Dana, Proc. Natl. Acad. Sci. USA 28 (1981), 7575). 
The presence of GLP-1 receptor expressed in COS cells was 
assessed by binding of the radioiodinated peptide followed by 
photographic emulsion autoradiography and screening by dark 
field microscopy (Gearing et al. EMBO J. 8 (1989), 3667). GLP- 

25 1(7-36) amide, as well as the other peptides, were purchased 
from Peninsula Laboratories. lodination was performed by the 
iodine monochloride method (Contreras et al. Meth.Enzymol. 92 
(1983) , 277) , the peptide was purified by passage over Sephadex 
G-10 followed by CM-Sepharose and specific activity was 

30 determined by the self displacement technique (Calvo et al. 
Biochem. 212 (1983), 259).- A 1.6 kb cDNA clone (pGLPR-1) was 
isolated by subfr.actionation of an original positive pool and 
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was used to Isolate, by DNA hybridization screening, two 
additional, clones from primary positive pools. These plasmids 
(pGLPR-16 and -87) had inserts of 3.0 and 2.0 kb, respectively.. 
Transfection of these clones into COS cells generated high 

5 affinity (Kj^ = 0.6 nM) binding sites for GLP-1 (Fig. 2). This 
.affinity is comparable to that seen for binding of GLP-1 to the 
rat insulinoma cell line INS-1 (Asfari et al. Endocrinology 130 
(1992), 167) (Kjy = 0.12 nM; Fig. 3). In both cases a single 
high affinity binding component was detected. The binding to 

10 GLP-1 receptor transfected COS cells reached a plateau between 
1 and 10 nH. At concentrations above 10 nM a second, high 
capacity, low affinity, binding component was detected. 
Although specifically displacsJsle by cold GLP-1, this binding 
was also present in COS cells transfected with the expression 

15 vector alone and was therefore not further characterized. 

Binding of GLP-1 to the receptor expressed in COS cells was 
displaced by cold GLP-l with a 50 percent displacement achieved 
at 0.5 to 1 nM (Fig. 4). Other peptide hormones of related 
structure such as secretin, gastric inhibitory peptide (GIP) 

20 and vasoactive intestinal peptide (VIP) (Dupre in The Endocrine 
Pancreas, E. Samois Ed. (Raven Press, New York, (1991), 253 - 
281) and Ebert and Creutzfeld, Diabetes Metab. Rev. 3, (1987) 
did not displace binding. Glucagon could displace the binding 
by 50 percent but only at a concentration of one micromolar 

25 (Fig. 4) . The addition of subnanomolar concentrations of GLP*1 
to transfected COS cells stimulated the production of cyclic 
AMP indicating that the receptor was functionally coupled to 
activation of adenylate cyclase (Fig. 5) . 

DNA sequence analysis of the rat GLP-l receptor cDNA revealed 
30 a major open reading frame coding for a 463 amino acid 
polypeptide (SEQ ID No. 1) . Hydrophaphy plot analysis indicated 
the presence of an amino-terminal hydrophobic region most 
probably representing a leader sequence. This hydrophobic 
segment is followed by a hydrophilic domain of sO^out 120 amino 
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acids which contains three N-linked glycosylation sites. Seven 
hydrophobic segments are present which may form transmembrcme 
domains. Search for sequence identities showed the 6LP-1 
receptor to be homologous to the secretin receptor (Ishihara et 
sal. EMBO J. 10 (1991)/ 1635) (40 percent identity), the 
parathyroid hormone receptor (Jiippner et al. (Science 254 
(1991), 1024) (32.4 percent identity) and the calcitonin 
receptor (Lin et al. Science 254 (1991), 1022) (27.5 percent 
identity) (Fig. 1) . These four receptors do not share any 

10 significant sequence homology with other laiown members of the 
6-coupled receptor family and are characterized by a relatively 
long amino tenainal, probably extracellular, domain. The 
sequence of the extracellular domain is unique for each 
receptor, yet four cysteines are perfectly conserved (boxes in 

15 Fig. 1) . A fifth cysteine at position 126 of the GLP-1 receptor 
is also conserved in the parathyroid and calcitonin receptors 
and at a similar location in the secretin receptor (position 
123) . The highest sequence identity between the four proteins 
resides in the transmembrane domains. The carboxyl terminal, 

20 cytoplasmic, ends of each receptor are also very different. 
These receptors all stimulate the production of cyclic AMP in 
response to ligand binding (Ishihara et al. EMBO J. 10 (1991), 
1635) , Juppner et al. (Science 254 (1991), 1024) and Lin et al; 
Science 254 (1991), 1022) and are presumably coupled to the 

25 cyclase via 6sa. In that respect, it is interesting to note 
that a sequence motif present in the third cytoplasmic loop of 
the 6LP-1 receptors (RLAK, present just before the sixth 
transmembrane domain) is very similar to a motif of the beta2 
adrenergic receptor (KALK) present at the same location and 

30 whose basic cuaino acids have been shown to be important in the 
coupling of the receptor to Gsa (Okamoto et al. Cell 67 (1991) ; 
723) . Moreover, in the beta2 adrenergic receptor, this motif is 
preceeded by a basic amino acid located twelve amino acid 
' toward the amino-terminal end. This basic amino acid is also 

35 required at this particular distance for efficient coupling to 
Gsa. In the GLP-l receptor a lysine residue is also present at 
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a similar location. This suggests that, despite the very low 
overall sequence identity r a structural feature may have been 
conserved in the third cytoplasmic loop between the two 
receptors which, may be required for the coupling of receptor 
5 to the 6sa protein. 

Determination of the tissue distribution of the GLP-l receptor 
was performed by Northern blot analysis. Northern blot analysis 
was performed with 10 /ig of total RNA (Chomczynski and Sacchi, 
Anal.Biochem* 126 (1987), 156) denatured with glyoxal (McMaster 

10 and Carmichael, Proc. Natl. Acad. Sci. USA 74 (1977), 4835) 
separated on a 1% agarose gel and transferred to Nylon 
membranes (Thomas, Proc. Natl. Acad.Sci. USA 77 (1980), 5201). 
Hybridization was performed with the random primed labelled 
(Feinberg and Vogelstein, Anal.Biochem. 132 (1983), 6) 1,6 kb 

15 pGLPR-l insert. Two mENAs of 2.7 and 3.6 kb could be detected 
in pancreatic islets as well as in rat insulinoma cell lines 
(INS-1) , in stomach and in lung (Fig. 6) . No GLP-l receptor 
mRNA could be detected in brain, liver, thymus, muscle, 
intestine and colon. The presence of the GLP-l receptor has 

20 been reported in stomach where the peptide inhibits acid 
secretion by parietal cells in in vivo experiments (Schjoldager 
et al. Dig^Dis.Sci. 34 (1989), 703) but stimulates acid 
secretion on isolated parietal glands ( Schmidt ler et al. 
Am. J. Physiol. 260 (1991), G940) . Binding sites for GLP-l have 

25 also ben reported in lung membrane preparations (Richter et al. 
FEES Letter 1 (1990) , 78) but the role of the hormone on lung 
physiology is not known. 

A stable cell line expressing the cloned rat GLP-l receptor was 
established by Ca-phosphate mediated transfection (Maniatis et 
30 al.. Molecular Cloning. A Laboratory Manual. Cold Spring 
Harbour Laboratory, 1989) of the CHL cell line (ATCC CCL39) . 
The plasmid, pGLPR-1, which contains a 1.6 kb rat GLP-l 
receptor cDNA insert cloned in the pCDNA-1 vector, was 
cotransfected with the pWL-neo plasmid (Stratagene, La Jolla, 
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CA) Into CHL cells. The pWL-*neo plasmid contains the neomycin 
resistance gene. Stable clones were selected in medium 
containing 0.8 mg/ml 6418. A stable transformant expressing an 
estimate of 70.000 rat GLP-1 receptors pr cell was selected by 

5 this scheme and further propagated in the presence of 80 /zM 
G418. Membranes from this transformant was subsequently used in 
the high-volxime-throughput-screening (HVTS) assay as described 
in Example 3. Characterization of the receptor expressed by the 
GLP-l R/CHL cell line led to an estimated Kd of 0.8 nM for 

10 whole cells, 2.3 nM for cell membranes using ^^^I-GLP-1(7- 
36) amide as radioligand. 

EXMfPLE 2 

Molecular cloning of the human islet GLP-1 receptor cDNA, 

Human islets were prepared as described (Ricordi et al., 
15 Diabetes 37 (1988) , 413 - 420) , and polyA* RNA was isolated by 
affinity chromatography by ptxblished methods (Gonda et al., 
Mol. Cell. Biol. 2 (1982) 617 - 624). 

A human islet cDNA library was constructed in the AZAPII vector 
from Stratagene (La Jolla, CA) . Briefly, double stranded cDNA 
20 was synthesized as previously described (Aruf fo and Seed, 84 
(1987), 8573 - 8577; Thorens, Proc. Natl. Acad. Sci., USA 89 
(1992), 8641 - 8645), and E<soRI/fiotI adaptors (Stratagene, La 
Jolla, CA) were added with DNA ligase. 

The resulting cDNA molecules were phosphorylated with 
25 polynucleotide kinase before size fractionation on potassium 
acetate gradients (Aruffo and Seed, 84 (1987), 8573 - 8577). 
Double stranded cDNA with a size above 1.6 kb was ligated into 
AZAPII arms (Stratagene, La Jolla, CA) , packaged in A phage and 
grown on a lawn of XL*1 Blue E. coli cells as described in 
30 protocols from Stratagene. 

The CDNA library was screened by hybridization to a ^^P labelled 
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DNA fragment from the rat GLP-i receptor cDNA by previously 
described methods (Maniatis et al.. Molecular Cloning. A 
Laboratory Manual. Cold Spring Harbour Laboratory, 1982) . 
The reduced stringency conditions used were: prehybridization 
5 and hybridization in 30 % formamideir 5 * SSC, 5 * Denhardt, 50 
mM phosphate buffer pH 6.8, 5 mM EDTA, 0.2 % SDS and 100 /ig/Jnl 
salmon sperm DNA at 42*0. Washings were 4 * 30 min in 2 * SSC, 
0.2 % SDS at 42 'C (Maniatis et al.. Molecular Cloning. A 
Laboratory Manual. Cold Spring Harbour Laboratory, 1982) . 

10 Positive X phages were purified by replating and hybridization, 
the cDNA inserts contained in the Bluescript vector present in 
the X ZAPII arms were excised using helper phages obtained from 
Stratagene (La Jolla, CA) . The inserts were partially 
sequenced. One clone designated 3(20) showed high homology to 

15 the rat GIiP**l receptor and was sequenced (TcdDor and Richardson, 
Proc. Natl. Acad. Sci. , USA M (1987), 4767 - 4771) in its 
entire length. The DNA sequence is shown as SEQ ID No. 3. 

From homology cuialysis (Fig. 7) , it was concluded that this 
cDNA encoded the 3' part of the human GLP-1 receptor. 
20 The deduced amino acid sequence of the human receptor has 92 % 
identity to the rat GLP-l receptor in the region from amino 
acid number 170 to amino acid number 463 (numbers refer to the 
rat sequence) . 

The isolated human GLP-1 cDNA does not contain the entire open 
25 reading frame at the 5* end. However, a full length clone can 
easily be obtained by methods well known to persons skilled in 
the art. Among the alternative methods of choice, the following 
examples should be mentioned: 1) The human islet cDNA library 
can either be rescreened with a probe from the 5' end of the 
30 already cloned sequence. 2) Anchor-PCR or RACE (Rapid 
Amplification of gDNA Ends) (Kriangkum et al.. Nucleic Acids 
Res. 20 (1992) 3793 - 3794; Troutt et al. , Proc. Natl. Acad. 
Sci., USA I£ (1992), 9823 - 9825) methodology can be used to 
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Clone the remaining 5' sequences from islet HNA. 3} The 
remaining 5* part can be isolated from human genomic libraries, 
and DNA fragments considered to represent introns can be 
identified based on homology to the cDNA of the rat receptor 
5 and deleted by mutagenesis. 

After cloning of the 5' end of the open reading frame, this 
part of the cDNA can be fused to the remaining 3' part of the 
human GLP-1 receptor cDNA by the use of PGR or through fusion 
at appropriate restriction enzyme recognition sequences 
10 identified in both the 5' and the 3» parts. 

The cDNA encoding the full length open reading frame can be 
cloned in suitable mammalian expression vectors and transfected 
into mammalian cell lines for expression. Examples of such 
suitable cell lines are the CHO and CHL cells, but other 
15 maunmalicui cells will also express receptors of this type. 

It has recently been demonstrated that insect cells (Vasudevan 
et al. FEES Lett. 311 (1992), 7 - 11) and microorganisms like 
e.g. yeast (King et al.. Science 250 (1990), 121 - 123) can 
express 6-protein coupled receptors. 

20 Recently frog skin melanophore cells have been used to express 
G-protein coupled receptors (Potenza et al, Analytical 
Biochem., 206 . (1992), 315 - 322) and a functional coupling to 
adenylate cyclase was demonstrated. 

Other microorganisms like Aspergillus . Bacillus , E. coli might 
25 be able to express these receptors after appropriate genetic 
engineering and selection. 

It is therefore clear to persons skilled in the art that a 
number of different expression systems can be designed that 
will lead to expression of a functional receptor molecule. 
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As demonstrated in Example 3,^ the rat as well as the human GLP- 
1 receptor can be used in screening assays for detection of new 
potential agonist lead structxures. 

EXAMPLE 3 

5 High throughput screening assay for GLP-1 receptor agonists. 

Screening of microbial extracts for secondary metabolites with 
potential GLP--1 agonist activity was carried out using the SPA 
(Scintillation Proximity Assay) technology (US patent 4568649^ 
Hart and Greenwalt (Mol. Immunol. , i£ (1979) 265-267), Udenfri- 

10 end et al ( Proc. Natl. Acad. Sci. USA, 82 (1985) 8672-8676) . 
Wheatgerm agglutinin (WGA) coated SPA beads developed by Amers- 
ham International were used (US. patent 4568649, European pa- 
tent 0154734, Japanese patent appl. 84/52452). The WGA coat 
allows GLP-l receptor bearing membranes to be immobilized on 

15 the SPA beads. Membranes used in the screening assay were 
prepared from a CHL (ATTC CCL39) cell line expressing the clo- 
ned rat GLP-l receptor as described in in Example 1. Membranes 
were prepared essentially as decribed by Unden et al 
(Eur.J.Biochem. 145 (1984), 525-530). The binding of ^^^I-GIJP- 

20 1(7-36) amide to such immobilized receptors brings the tracer in 
close proximity to the scintillant present within the SPA beads 
resulting in the emission of light. Any unbound ligemd will 
not generate a signal. Thus iinder assay conditions a microbial 
extract - containing a component capable of binding to the GLP- 

25 1 receptor and thereby displacing the tracer - may be identi- 
fied by virtue of a reduction in signal intensity. 

A high throughput assay was established using 96 well 
microtiter plates. The assay was optimized with regard to the 
amotints of WGA particles, membrane and tracer used. (The ^^I- 
30 GLP-l (7-36) amide tracer was labelled using the lactoperoxidase 
method (Mozrrison et al.. Methods Enzymol. 70 (1980), 214-219) 
followed by purification on reverse phase HPLC) . Using a Pac- 
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kard TopCount^ mlcroplate scintillation coiinter (Packard Inst- 
rument Company) these optimized conditions resulted in a Bq of 
more than 7000 cpm, (Non specific binding determined in the 
presence of 500 nH unlabelled 6LP-1 (7-36) amide amounts to less 
5 than 1000 cpm. ICgQ=0. 5-1.0 nM GLP-l(7-36) amide) - 

So far 1250 microbial extracts have been screened using the SPA 
GLP-1 receptor assay. The extracts were tested at a final dilu- 
tion of 1:400. Under these conditions 15 out of the 1250 ex- 
tracts resulted in a reduction of specific counts to below the 

10 chosen cut-off level. These 15 hits have been further 
characterized in a secondary assay. This secondary assay was 
designed to test whether cAMP synthesis in a GLP-1 receptor 
bearing cell line can be induced by components in the extract. 
i9-TC3 cells (Hanahan et al.. Nature 315 (1985) 115-122) and 

15 Efrat et al (Proc. Natl. Acad. Sci. USA 85 (1988) 9037-9041) grown 
in 96-well microtiter plates were exposed to extracts diluted 
in culture media. After 20 min at 37**C the cells were lysed by 
addition of acid and the cAMP concentration determined using 
the cyclic AMP SPA system (Amersham International). Of the 15 

20 primary hits tested in this secondary assay, 5 extracts have 
been found to significantly increase the cAMP level in i9-TC3 
cells. 

It has thus been demonstrated that it is feasible that the 
screening approach described in this patent application can 
25 result in the isolation of natural compounds with GLP-1 agonist 
activity. The use of such compunds as lead structures for a 
medicinal chemistry approach will be of significant importance 
in the design of novel GLP-1 agonists. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Thorens, Bernard 
(ii) TITLE OF INVENTION: Novel Peptide 
5 (ill) NUMBER OF SEQUENCES: 4 

(1v} CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NOVO NORDISK A/S, Patent Department 

(B) STREET: Novo Alle 
10 (C) CITY: Bagsvaerd 

(E) COUNTRY: Denmark 

(F) ZIP: DK-2880 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
15 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
20 (B) FILING DATE: 

(C) CLASSIFICATION: 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +45 44 44 88 88 

(B) TELEFAX: +45 44 49 32 56 
25 (C) TELEX: 37307 

(2) INFORMATION FOR SEQ ID NO: I: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3066 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 
5 (0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(v1) ORIGINAL SOURCE: 
(A) ORGANISM: Rat 

10 (1x) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 1408 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



TCCTGAGCGC CCCGCC ATG GCC GTC ACC CCC AGC CTG CTG C6C CTG 6CG 49 
« Met Ala Val Thr Pro Ser Leu Leu Arg Leu Ala 

1 5 10 

CTC CTG CTG CTC GGG GCG GTG GGC AGG GCC GGC CCC CGC CCC CAG GGT 97 
Leu Leu Leu Leu Gly Ala Val Gly Arg Ala Gly Pro Arg Pro Gin Gly 
15 20 25 

20 GCC ACG GTG TCC CTC TCA GAG ACA GTG CAG AAA TGG AGA GAG TAT CGG 145 
Ala Thr Val Ser Leu Ser Glu Thr Val Gin Lys Trp Arg Glu Tyr Arg 
30 35 40 

CAC CAG T6C CAA CGT HC CTC ACG GAA GCG CCA CTC CTG GCC ACA GGT 193 
His Gin Cys Gin Arg Phe Leu Thr Glu Ala Pro Leu Leu Ala Thr Gly 
25 45 50 55 
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CTC TTC TGC AAC CGA ACC TTT GAT GAC TAG GCC T6C TGG CCA GAT GGG 
Leu Phe Cys Asn Arg Thr Phe Asp Asp Tyr Ala Cys Trp Pro Asp 61 y 
60 65 70 75 



241 



CCC CCA GGT TCC TTT GTG AAT GTC AGT TGC CCC TGG TAC CTG CCG TGG 
5 Pro Pro Gly Ser Phe Val Asn Val Ser Cys Pro Trp Tyr Leu Pro Trp 
80 85 90 



289 



GCC AGT AGT GTG CTC CAA GGG CAT GTG TAC CGG TTC TGC ACG GCC GAG 
Ala Ser Ser Val Leu Gin Gly His Val Tyr Arg Phe Cys Thr Ala 61 u 
95 100 105 



337 



10 GGT ATC TG6 CT6 CAT AAG GAC AAC TCC AGO CTG CCC TG6 A6G GAC CTG 
Gly He Trp Leu His Lys Asp Asn Ser Ser Leu Pro Trp Arg Asp Leu 
110 115 120 



385 



TCG 6A6 T6C 6AA 6A6 TCC AA6 CAA GGA GAG AGA AAC TCC CCT GAG GAA 
Ser 61 u Cys 61 u Glu Ser Lys Gin Gly 61 u Arg Asn Ser Pro 61 u 61 u 
15 125 130 135 



433 



CA6 CTC CT6 TCG CTG TAC AH ATC TAC ACG GT6 GGG TAC GCA CTT TCT 
Gin Leu Leu Ser Leu Tyr lie He Tyr Thr Val Gly Tyr Ala Leu Ser 
140 145 150 155 



481 



TTC TCT GCC HG GTC ATC GCT TCA GCC ATC CTT GTC AGC UC AGA CAC 
20 Phe Ser Ala Leu Val He Ala Ser Ala He Leu Val Ser Phe Arg His 
160 165 170 



529 



TTG CAC TGC ACC AGG AAC TAC ATC CAC CTG AAC CTG TTT GCG TCC TTC 
Leu His Cys Thr Arg Asn Tyr He His Leu Asn Leu Phe Ala Ser Phe 
175 180 185 



577 



25 ATC CTC CGA GCA CTG TCC GTC TTC ATC AAA GAC GCT GCC CTC AAG TGG 
He Leu Arg Ala Leu Ser Val Phe He Lys Asp Ala Ala Leu Lys -Trp 
190 195 200 



625 
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ATG TAT AGC AC6 GCT GCG CAA CAG CAC CA6 TGG GAT GGG CTC CTC TCG 
Met Tyr Ser Thr Ala Ala Gin Gin His Gin Trp Asp Gly Leu Leu Ser 
205 210 215 



673 



TAT CAG GAC TCT CTG GGC TGC CGA CTG GTG TTC CTG CTC ATG CAA TAC 
5 Tyr Gin Asp Ser Leu Gly Cys Arg Leu Val Phe Leu Leu Met Gin Tyr 
220 225 230 235 



721 



TGC GTG GCG GCC AAC TAC TAC TGG TTG CTG GTG 6AA GGC GTG TAT CTG 
Cys Val Ala Ala Asn Tyr Tyr Trp Leu Leu Val Glu Gly Val Tyr Leu 
240 245 250 



769 



10 TAC ACA CTG CTG GCC HC TCG GTG TTC TCG GAG CAG GGC ATC TTC AAG 
Tyr Thr Leu Leu Ala Phe Ser Val Phe Ser Glu Gin Arg He Phe Lys 
255 260 265 



817 



CTG TAC CTG AGC ATA GGC TGG GGA GTT CCG CTG CTG HC GTT ATC CCC 
Leu Tyr Leu Ser He Gly Trp Gly Val Pro Leu Leu Phe Val He Pro 
15 270 275 280 



865 



TGG GGC AH GTC AAG TAT CTC TAC GAG GAC GAG GGT TGC TGG ACC AGG 
Trp Gly He Val Lys Tyr Leu Tyr Glu Asp Glu Gly Cys Trp Thr Arg 
285 290 295 



913 



AAC TCC AAC ATG AAC TAT TGG CTC ATC ATA C6C HG CCC AH CTC TTT 
20 Asn Ser Asn Met Asn Tyr Trp Leu He He Arg Leu Pro He Leu Phe 
300 305 310 315 



961 



GCA ATC GGG GTC AAC HC CTT GTC TTC ATC CGG GTC ATC TGC ATC GTG 
Ala He Gly Val Asn Phe Leu Val Phe He Arg Val He Cys He Val 
320 325 330 



1009 
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ATA GCC AAG CTQ AA6 GCT AAT CTC ATG TGT AAG ACC GAC ATC AAA TGC 1057 
lie A1a Lys Leu Lys A1a Asn Leu Met Cys Lys Thr Asp He Lys Cys 
335 340 345 

AGA CTC GCG AAG TCC ACT CTG ACG CTC ATC CC6 CTT CTG GGC ACG CAT 1105 
5 Arg Leu Ala Lys Ser Thr Leu Thr Leu lie Pro Leu Leu Gly Thr His 
350 355 360 

GAA GTC ATC TTT GCC TTT GTG ATG GAC GAG CAC GCC CGA GGA ACC CTA 1153 
61 u Val He Phe Ala Phe Val Met Asp Glu His Ala Arg Gly Thr Leu 
365 370 375 

10 CSC TTC GTC AAG CTG TTC ACA GAG CTC TCC HC ACT TCC TTC CAG GGC 1201 
Arg Phe Val Lys Leu Phe Thr Glu Leu Ser Phe Thr Ser Phe Gin Gly 
380 385 390 395 

TTT ATG GTG GCT GTC TTG TAC TGC TTT GTC AAC AAT GAG GTC CAG ATG 1249 
Phe Net Val Ala Val Leu Tyr Cys Phe Val Asn Asn Glu Val Gin Met 
15 400 405 410 

GAG TTT CGG AAG AGC TGG GAG CGC TGG AGG CTG GAG CGC TTG AAC ATC 1297 
Glu Phe Arg Lys Ser Trp Glu Arg Trp Arg Leu Glu Arg Leu Asn He 
415 420 425 

CAG AGG GAC AGC AGC ATG AAA CCC CTC AAG TGT CCC ACC AGC AGC GTC 1345 
20 Gin Arg Asp Ser Ser Met Lys Pro Leu Lys Cys Pro Thr Ser Ser Val 
430 435 440 

AGC AGT GGG GCC ACG GTG GGC AGC AGC GTG TAT GCA GCC ACC TGC CAA 1393 
Ser Ser Gly Ala Thr Val Gly Ser Ser Val Tyr Ala Ala Thr Cys Gin 
445 450 455 

25 AAT TCC TGC AGC TGA6CCCCAG TGCTGCGCTT CCTGATGGTC CTTGCTGCTG 1445 
Asn Ser Cys Ser 
460 
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GCT6GGTG6C CATCCCAGGT GGGAGAGACC CTGGGGACAG GGAATATGAG GGATACAGGC 1505 

ACATGTGTGT GC6TGCCCGC ACACCACACA CACACACACA CACACACACA CACACACACA 1565 

CACACACACA CACACGCHT CCTCCTCAAA CCTATCAAAC AGGCATCGGC ATCGGCAGTG 1625 

CCTCCTGGGA CCACAGACAC ATGTTCTCCA AG6AGAACAG CCTGCTAAH TAATCTCAGG 1685 

5 CGACAGGAAG AGAGGAAGAA ACAAHGCTG TTAAGACGAG GAGGACTTCT TCCTGTTAAA 1745 

GCT6CAAGGC CCnGGGGTT CCCTCGGACA GAACTGCAAA TCAACCCCGG AACTCTCGCT 1805 

CAA66GCAAT TGCTGACGGG TG6AACTTGG GCTTGCGA6A GGAG6CAGGT CCATGAGA6A 1865 

CCTGCCCHG 6AACCTCAGC CAGCACAGCG CTCATCAAGG TGAGCTGGCT GTGCTGTGTG 1925 

CACGGCTGGG GHGTCACCT ACATCAGCCT TCCTCTCGGA CAAGAGGCH nCTCTGTGC 1985 

10 ATCTGGAGTG CCGCCAnCC TCCATCTGCC CGHCATCCG CCATCCTGTC TTTGCCTTGG 2045 

GGA6GGGGAG GTTTGTTGAA GTCATGCCGT GCAGCTCTTT CTGGAAATAT CTGTGGATGG 2105 

TGHGAAGAT AAGCATGG6G 6AGATACAAC AGAGGCAGTC ITTGCCCATG GCCACnCTT 2165 

6CCTGGTCCT HAAGCCACT TTGCTGCHG GTHCTGCCC TGCATG66TA CTACTAGGGC 2225 

AGGTCCCAAG HGAGAAGCC CAGAGGTGAG GTGTGAACCC TCAGnCTGT T6TAAAGATG 2285 

15 CTCAAATACC CTCTAAGGH CATCTAAAGG AGTAACCTGC CTAGGG6TGC TGHGACCTG 2345 

AAATCAAGAG GACCAAAGGA TCCATTGCCA ACACCCCCCA TCCCCCACAC ACACCTCATC 2405 

TGTGACCAGA GTCTATGCTT TGAATCAGAA TGGGCTATAT CCTCTGACCT CAGAG6CTAT 2465 

GACCCAGAAG AGATTCnCC CTGAATCCTC CCACTTTGCA CACATATAGA CTTTATCCn 2525 
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CTTCACTCTG TGTCTATTCA AACGTATAAT TCTGGTTTCT CTCACCCCAC GGAAGAACTA 2585 

GATCACA6CA ACTGTTATGT HGAGGGAGT GGGGGAGAAG GTGATTGATT TGACCCCCTC 2645 

TCCCCCACCG GTGHGATAA GTAGCGTCTG TCCCACCTCC AGACTCCACC CACACATAAT 2705 

6AGCA6CACA TAGACCAGGA T6G6GGGGGT 6GTATATCAT GCHGCCCTC CTCCAACCAC 2765 

5 TATGAGAAGG CTAGCAGAAG ACACCACTGC ACAGACCCAA 6TCCAAGGAC T6CCTCCCA6 2825 

GGAAHAGGC AGTGACnCC TAGAGGCCAA GAAAGACTCC AAGAGCTGGA GAAGAATCCT Z885 

A6TCGATCTG GATCTCTTTT GAGGHGGGG TTGGGGTGGC TTTCAATGGA TTCTCTCATG 2945 

AGGCHATCT CTCCCTCATC CCGTGGAGAG TGGG6GACCC TCCCTAGTGC TCACACTAGA 3005 

CACTGTGCCC CTTGGAGAGG CATAAGGCAT GTAT6GGAGA TAATAAT6GG CTATAAAACA 3065 

10 T 3066 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Ala Val Thr Pro Ser Leu Leu Arg Leu Ala Leu Leu Leu Leu G1y 
IS 10 15 
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Ala Val 61y Arg Ala Gly Pro Arg Pro Gin Gly Ala Thr Val Ser Leu 
20 25 30 

Ser Glu Thr Val Gin Lys Trp Arg Glu Tyr Arg His Gin Cys Gin Arg 
35 40 45 

5 Phe Leu Thr Glu Ala Pro Leu Leu Ala Thr Gly Leu Phe Cys Asn Arg 
50 55 60 

Thr Phe Asp Asp Tyr Ala Cys Trp Pro Asp Gly Pro Pro Gly Ser Phe 
65 70 75 80 

Val Asn Val Ser Cys Pro Trp Tyr Leu Pro Trp Ala Ser Ser Val Leu 
10 85 90 95 

Gin Gly His Val Tyr Arg Phe Cys Thr Ala Glu Gly He Trp Leu His 
100 105 110 

Lys Asp Asn Ser Ser Leu Pro Trp Arg Asp Leu Ser Glu Cys Glu Glu 
115 120 125 

15 Ser Lys Gin Gly Glu Arg Asn Ser Pro Glu Glu Gin Leu Leu Ser Leu 
130 135 140 

Tyr He He Tyr Thr Val Gly Tyr Ala Leu Ser Phe Ser Ala Leu Val 
145 150 155 160 

He Ala Ser Ala He Leu Val Ser Phe Arg His Leu His Cys Thr Arg 
20 165 170 175 

Asn Tyr He His Leu Asn Leu Phe Ala Ser Phe He Leu Arg Ala Leu 
180 185 190 

Ser Val Phe He Lys Asp Ala Ala Leu Lys Trp Met Tyr Ser Thr Ala 
195 200 205 
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Ala Gin Gin His Gin Trp Asp Gly Leu Leu Ser Tyr Gin Asp Ser Leu 
210 215 220 

Gly Cys Arg Leu Val Phe Leu Leu Met Gin Tyr Cys Val Ala Ala Asn 
225 230 235 240 

5 Tyr Tyr Trp Leu Leu Val Glu Gly Val Tyr Leu Tyr Thr Leu Leu Ala 
245 250 255 

Phe Ser Val Phe Ser Glu Gin Arg He Phe Lys Leu Tyr Leu Ser He 
260 265 270 

Gly Trp Gly Val Pro Leu Leu Phe Val He Pro Trp Gly He Val Lys 
10 275 280 285 

Tyr Leu Tyr Glu Asp Glu Gly Cys Trp Thr Arg Asn Ser Asn Het Asn 
290 295 300 

Tyr Trp Leu He He Arg Leu Pro He Leu Phe Ala He Gly Val Asn 
305 310 315 320 

15 Phe Leu Val Phe He Arg Val He Cys He Val He Ala Lys Leu Lys 
325 330 335 

Ala Asn Leu Met Cys Lys Thr Asp He Lys Cys Arg Leu Ala Lys Ser 
340 345 350 

Thr Leu Thr Leu He Pro Leu Leu Gly Thr His Glu Val He Phe Ala 
20 355 360 365 

Phe Val Met Asp Glu His Ala Arg Gly Thr Leu Arg Phe Val Lys Leu 
370 375 380 

Phe Thr Glu Leu Ser Phe Thr Ser Phe Gin Gly Phe Met Val Ala Val 
385 390 395 400 
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Leu Tyr Cys Phe Val Asn Asn Glu Va1 Gin Met Glu Phe Arg Lys Ser 
405 410 415 

Trp Glu Arg Trp Arg Leu Glu Arg Leu Asn He Gin Arg Asp Ser Ser 
420 425 430 

5 Met Lys Pro Leu Lys Cys Pro Thr Ser Ser Val Ser Ser Gly Ala Thr 
435 440 445 

Val Gly Ser Ser Val Tyr Ala Ala Thr Cys Gin Asn Ser Cys Ser 
450 455 460 



(2) INFORMATION FOR SEQ ID N0:3: 

10 (i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 1909 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 
ao (A) NAME/KEY: CDS 

(B) LOCATION: 3.. 887 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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TC AGA CAC CTS TAC TGC ACC AGG AAC TAC ATC CAC CT6 AAC CTG TTT 
Arg His Leu Tyr Cys Thr Arg Asn Tyr He His Leu Asn Leu Phe 
IS 10 15 



47 



GCA TCC TTC ATC CTG CGA GCA TTG TCC GTC HC ATC AAG GAC GCA GCC 
5 A1a Ser Phe He Leu Arg Ala Leu Ser Val Phe He Lys Asp Ala Ala 
20 25 30 



95 



CTG AAG TGG ATG TAT AGC ACA GCC GCC CAG CAG CAC CAG TGG GAT GGG 
Leu Lys Trp Met Tyr Ser Thr Ala Ala Gin Gin His Gin Trp Asp Gly 
35 40 45 



143 



10 CTG GTC TGC TAC CAG GAC TCT CTG AGC TGC CGC CTG GT6 TTT CTG CTC 
Leu Leu Ser Tyr Gin Asp Ser Leu Ser Cys Arg Leu Val Phe Leu Leu 
50 55 60 



191 



ATG CAG TAC TGT GTG GCG GCC AAT TAC TAC TGG CTC HG GTG GAG GGC 
Met Gin Tyr Cys Val Ala Ala Asn Tyr Tyr Trp Leu Leu Val Glu Gly 
15 65 70 75 



239 



GTG TAC CTG TAC ACA CTG CTG GCC TTC TCG GTG TTC TCT GAG CAA TGG 
Val Tyr Leu Tyr Thr Leu Leu Ala Phe Ser Val Phe Ser Glu Gin Trp 
80 85 90 95 



287 



ATC TTC AGG CTC TAC GTG AGC ATA GGC TGG G6T GH CCC CTG CTG TTT 
20 He Phe Arg Leu Tyr Val Ser He Gly Trp Gly Val Pro Leu Leu Phe 
100 105 110 



335 



GTT GTC CCC TGG GGC ATT GTC AAG ATC CTC TAT GAG GAC GAG GGC TGC 
Val Val Pro Trp Gly He Val Lys He Leu Tyr Glu Asp Glu Gly Cys 
115 120 125 



383 



25 TGG ACC AGG AAC TCC AAC ATG AAC TAC TGG ac ATT ATC GGG CTG CCC 
Trp Thr Arg Asn Ser Asn Met Asn Tyr Trp Leu He He Arg Leu Pro 
130 135 140 



431 
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ATT CTC m 6CC An GGG GTG AAC HC CTC ATC TTT GTT CGG GTC ATC 
He Leu Phe Ala He G1y Val Asn Phe Leu He Phe Val Arg Val He 
145 150 155 



479 



TGC ATC GTG GTA TCC AAA CTG AAG GCC AAT GTC ATG TGC AAG ACA GAG 
5 Cys He Val Val Ser Lys Leu Lys Ala Asn Val Met Cys Lys Thr Asp 
160 165 170 175 



527 



ATC AAA TGC AGA CTT GCC AAG TCC ACG CTG ACA CTC ATC CCC CTG CTG 
He Lys Cys Arg Leu Ala Lys Ser Thr Leu Thr Leu He Pro Leu Leu 
180 185 190 



575 



10 GGG ACT CAT GAG GTC ATC TTT GCC TTT GTG ATG GAC GAG CAC GCC CGG 
Gly Thr His Glu Val He Phe Ala Phe Val Met Asp Glu His Ala Arg 
195 200 205 



623 



GGG ACC CTG CGC TTC ATC AAG CTG TTT ACA GAG CTC TCC TIC ACC TCC 
Gly Thr Leu Arg Phe He Lys Leu Phe Thr Glu Leu Ser Phe Thr Ser 
15 210 215 220 



671 



no CAG GGG CTG ATG GTG GCC ATC TTA TAC TGC TTT GTC AAC AAT GAG 
Phe Gin Gly Leu Met Val Ala He Leu Tyr Cys Phe Val Asn Asn Glu 
225 230 235 



719 



GTC CAG CTG GAA TTT CGG AAG AGC TGG GAG CGC TGG CGG Cn GAG CAC 
20 Val Gin Leu Glu Phe Arg Lys Ser Trp Glu Arg Trp Arg Leu Glu His 
240 245 250 255 



767 



HG CAC ATC CAG AGG GAC AGC AGC ATG AAG CCC CTC AAG TGT CCC ACC 
Leu His He Gin Arg Asp Ser Ser Met Lys Pro Leu Lys Cys Pro Thr 
260 265 270 



815 



25 AGC AGC CTG AGC AGT GGA GCC ACG GCG GGC AGC AGC ATG TAC ACA GCC 
Ser Ser'Leu Ser Ser Gly Ala Thr Ala Gly Ser Ser Met Tyr Thr Ala 
275 280 285 



863 
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ACT TGC CAG GCC TCC T6C AGC TGAGACTCCA GCGCCTGCCC TCCCTGGG6T 914 
Thr Cys Gin Ala Ser Cys Ser 

290 295 

CCTTGCTGCG GCCGG6TGGC AATCCAGGAG AAGCAGCCTC CTAATTTGAT CACAGTGGCG 974 

5 AGAGGAGAGG AAAAACGATC GCTGTGAAAA TGAGGAG6AT TGCnCHGT GAAACCACAG 1034 

GCCCnGGGG TTCCCCCAGA CAGA6CCGCA AATCAACCCC AGACTCAAAC TCAAGGTCAA 1094 

CGGCHATTA GTGAAACTGG GGCHGCAAG AGGA66TGGT TCTGAAAGTG GCTCHCTAA 1154 

CCTCAGCCAA ACACGAGCGG GAGTGACGGG AGCCTCCTCT GCTTGCATCA CnGGGGTCA 1214 

CCACCCTCCC CTGTCTTCTC TCAAAGGGAA GCTGTTTGTG TGTCTG6GTT GCHAmCC 1274 

10 CTCATCTTGC CCCCTCATCT CACTGCCCAG TrrCTTTTTG AGGGCTTGTT 6GCCACTGCC 1334 

' AGCAGCTGTT TCTGGAAATG GCTGTAGGTG GTGTTGAGAA AGAATGAGCA TTGAGACACG 1394 

GTGCTCGCTT CTCCTCCAG6 TATTTGAGTT GTTTTGGTGC CTGCCTCTGC CATGCCCAGA 1454 

GAATCAGGGC AGGCHGCCA CCGGGGAACC CAGCCCTGGG GTATGAGCTG CCAAGTCTAT 1514 

TTTAAAGACG CTCAAGAATC CTCTGGGGTT CATCTAG6GA CACGTTAGGA ATGTCCAGAC 1574 

15 TGTGGGTGTA GGTTACCTGC CACTTCCAGG ACGCAGAGGG CCAAGAGAGA CATTGCCTCC 1634 

ACCTCTCCTG AATACHATC TGTGACCACA CGCTGTCTCT TGAGATTTGG ATACACTCTC 1694 

TAGCTTTAGG GGACCATGAA GAGACTCTCT TAGGAAACCA ATAGTCCCCA TCAGCACCAT 1754 

GGAGGCAG6C TCCCCCTGCC TTTGAAAnC CCCCACTTGG 6AGCTGATAT ACTTCACTCA 1814 

CTTTTCTTTA HGCTGIGAT AGTCTGTGTG CACAATG6GC AATTCTGACT TCTCCCATCT 1874 
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AGTGAAATGA GCGAAATCAT GGHGTAGTG ATCH 1909 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino adds 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Arg His Leu Tyr Cys Thr Arg Asn Tyr He His Leu Asn Leu Phe Ala 
10 1 5 10 15 

Ser Phe He Leu Arg Ala Leu Ser Val Phe He Lys Asp Ala Ala Leu 
20 25 30 

Lys Trp Met Tyr Ser Thr Ala Ala Gin Gin His Gin Trp Asp Gly Leu 
35 40 45 

15 Leu Ser Tyr Gin Asp Ser Leu Ser Cys Arg Leu Val Phe Leu Leu Met 
50 55 60 

Gin Tyr Cys Val Ala Ala Asn Tyr Tyr Trp Leu Leu Val Glu Gly Val 
65 70 75 80 

Tyr Leu Tyr Thr Leu Leu Ala Phe Ser Val Phe Ser Glu Gin Trp He 
20 85 90 95 

Phe Arg Leu Tyr Val Ser He Gly Trp Gly Val Pro Leu Leu Phe Val 
100 105 no 
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Val Pro Trp Gly lie Val Lys He Leu Tyr 61 u Asp 61 u 61 y Cys Trp 
115 120 125 

Thr Arg Asn Ser Asn Met Asn Tyr Trp Leu He He Arg Leu Pro He 
130 135 140 

5 Leu Phe Ala He 61y Val Asn Phe Leu He Phe Val Arg Val He Cys 
145 150 155 160 

He Val Val Ser Lys Leu Lys Ala Asn Val Met Cys Lys Thr Asp He 
165 170 175 

Lys Cys Arg Leu Ala Lys Ser Thr Leu Thr Leu He Pro Leu Leu Gly 
10 180 185 190 

Thr His 61 u Val He Phe Ala Phe Val Met Asp 61 u His Ala Arg 61y 
195 200 205 

Thr Leu Arg Phe He Lys Leu Phe Thr 61 u Leu Ser Phe Thr Ser Phe 
210 215 220 

IS ein 61y Leu Met Val Ala He Leu Tyr Cys Phe Val Asn Asn 61 u Val 
225 230 235 240 

61n Leu 61 u Phe Arg Lys Ser Trp 61 u Arg Trp Arg Leu 61 u His Leu 
245 250 255 

His He 61n Arg Asp Ser Ser Met Lys Pro Leu Lys Cys Pro Thr Ser 
20 260 265 270 



Ser Leu Ser Ser 61 y Ala Thr Ala 61 y Ser Ser Met Tyr Thr Ala Thr 
275 280 285 



Cys 61 n Ala Ser Cys Ser 
290 
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CLAIMS 

1. A recombinant glucagon-like peptide-1 (GLP-1) receptor. 

2. A GLP^l receptor according to claim 1 of mammalian 
origin. 

5 3. A GLP-1 receptor according to claim 2 of rat or human 
origin* 

4. A GIiP-1 receptor according to claim 3, which comprises 
the amino acid sequence shown in SEQ ID No. 1, or an analogue 
thereof binding 6LP-1 with an affinity constant below 100 nM, 

10 preferably below 10 nM. 

5. A GLP-1 receptor according to claim 3, which comprises 
the partial amino acid sequence shown in SEQ ID No, 3, or an 
analogue thereof binding GLP-1 with an affinity constant below 
100 nM, preferably below 10 nM. 

15 6. A GLP-1 receptor according to any of the claims 1 to 5, 
which is in a solubilised or reconstituted form. 

7. A DNA construct which comprises a DNA sequence encoding 
a GLP-1 receptor according to any of the claims 1 to 6. 

8. A DNA construct according to claim 7, which comprises the 
20 DNA sequence shown in SEQ ID No. 1, or a DNA sequence coding 

for a functional analogue thereof binding GLP-1 with an 
affinity constant below 100 nM, preferably below 10 nM. 

9. A DNA construct according to claim 7, which comprises the 
partial DNA sequence shown in SEQ ID No. 3, or a DNA sequence 

25 coding for a functional analogue thereof binding GLP-1 with an 
affinity constant below 100 nM, preferably below 10 nM. 



wo 93/19175 



PCr/EP93/00697 



38 

10. A recombinant: expression vector which carries an inserted 
DNA construct according to any o£ claims 7 to 9* 

11. A cell containing a recombinant expression vector 
according to claim 10. 

5 12. A cell containing a DNA construct according to any of 
claims 7 to 9 integrated in its genome. 

13. A cell according to claim 11 or 12, which is an 
eukaryotic cell, in particular an insect or a mammalian cell. 

14. A method of screening for agonists or enhancers of GLP-l 
10 activity^ the method comprising incubating a GLP-*1 receptor 

according to any of claims 1 to 6 with a substance suspected to 
be an agonist of GLP-1 activity and subsequently with a GLP-1 
or an analogue thereof, and detecting any effect of binding of 
GLP-1 or the analogue to the GLP-1 receptor. 

15 15. A method of screening for agonists or enhancers of GLP-1 
activity, the method comprising incubating GLP-1 or an analogue 
thereof with a substance suspected to be an agonist of GLP-1 
activity and subsequently with a GLP-1 receptor of the 
invention, and detecting any effect of binding of GLP-1 or the 

20 analogue to the receptor. 

16. Use of a GLP-1 receptor according to any of claims 1 to 
6 for screening for agonists of GLP-1 activity. 

17. Use of DNA constructs according to claims 7 to 9 for 
isolation of tissue and/or organ specific variants of the GLP-1 

25 receptor. 

18. Use of a receptor isolated according to claim 17 for the 
screening of GIP-l agonists. 
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6LPR MAVTP SLLR LALLLL6AVGRA6PRPQGA 28 

SECR MLSTMRPR-LSLLL LRLLLLTKAAHTV GV 28 

PTHR MGAPRISHSLALLLCCSVLSSVYALVDADDVITKEEQIILLRNAQAQCEQEL 52 

CTRl MRFTLTRWCLTLFIFLNRPLPVLPDSAD6AHTPTLEPEPFLY 42 



6LPR TVSLSETVQKWREYRHQCQRFLTE APLLATGLF 61 

SECR PPRLCDVRRVLLEERAHCLQQLSKEK KGALGPETASG— 65 

PTHR KEVLRVPELAESAKDWMSRSAKTKKEKPAEKLYPQAEESREVSDRSRLQDGF 104 

CTRl —ILGKQRM LEAQHRC Y DRMQKLPPYQGEGLY 72 







I 




I 












GLPR 


C 


NRTFDDYA 


C 


WPDGPPGSFVNVS 


c 


PWYLPWASSVLQGHVYRF 


C 


T 


105 


SECR 


C 


EGLWDNMS 


c 


WPSSAPARTVEVR 


C 


PKSLLSLSNK-NGSLFRN 


C 


T 


108 


PTHR 


c 


LPEWDNIV 


c 


WPAGVP6KVVAVP 


C 


PDYFYDFNHK— GRAYRR 


c 


D 


146 


CTRl 


c 


NRTWDGWS 


c 


WDDTPA6VLAEQY 


C 


PDYFPDFDA—AEKVTKY 


c 


G 


114 












^ ♦ • • • 









i I_ 

GLPR AEGIWLHKDNSSLPWRDLSECEESKQGERNSPEEQLLSLYIIYTVGYALSFS 157 

SECR QDG-W SETFPRPDLACGVNINNSFNERRHAYLLKLKVMYTVGYSSSLA 155 

PTHR SNGSWELVP6NNRTWANYSECVKFLTNETREREV-FDRL6MIYTV6YSISLG 197 

CTRl EDGDWYRHPESNISWSNYTMCNAFTP—DKLQNAYI—LYYLAIVGHSLSIL 162 



II 

GLPR ALVIASAILVSFRHLHCTRNYIHLNLFASFILRALSVFIKDAALKWMYSTAA 209 

SECR MLLVALSILCSFRRLHCTRNYIHMHLFVSFILRALSNFIKDAVL — FSSDD 204 

PTHR SLTVAVLILGYFRRLHCTRNYIHMHLFVSFMLRAVSIFIKDAVLYSGVSTDE 249 

CTRl TLLISLGIFMFLRSISCQRVTLHKNMFLTYVLNSIIIIVHLVVI 206 



III 



GLPR QQHQWD6-LLSY~QDS LGCRLVFLLMQYCVAANYYWLLVEGVYLY 252 

SECR VTYCDAHK V6CKLVMIFFQYCIMANYAWLLVEGLYLH 241 

PTHR lERITEEELRAFTEPPPADKAGFVGCRVAVTVFLYFLTTNYYWILVEGLYLH 301 

CTRl — VPNGELVK-RDPPI CKVLHFFHQYMMSCNYFWMLCEGVYLH 246 



Fig. lA 



SUBSTITUTE SHEET 
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IV 



GLPR TLLAFSVFSEQRIFKLYLSIGWGVPLLFVIPWGIVKYLYEDEGCWTRNSNMN 

SECR TLLAISFFSERKYLQAFVLLGWGSPAIFVALWAITRHFLENTGCWDINANAS 

PTHR SLIFMAFFSEKKYLWGFTLFGWGLPAVFVAVWVTVRATLANTECWDLSSGNK 

CTRl TLIVVSVFAEGQRLWWYHVLGWGFPLIPTTAHAITRANLFNDNCW-LSVDTN 
.*. .*.*.. . * . . ... ...** . . . 



GLPR YWLIIRLPILFAIGVNFLVFIRVICIVIAKLKANLMCKTDIKC — RLAK5T 

SECR VWWVIRGPVILSILINFIFFINILRILMRKLRTQETRGSETNH-YKRLAKST 

PTHR KW-IIQVPILAAIVVNFILFINIIRVLATKLRETNAGRCDTRQQYRKLLKST 

CTRl LLYIIH6PVMAALVVNFFFLLNILRVLVKKLKESQE — AESHMYLKAVRAT 



VI VII 

GLPR LTLI PLLGTHEVI FAFVMDEHARGTLRFVKLFTELSFTSFQGFMVAVLYCFV 

SECR LLLIPLFGIHYIVFAFSHEDAME VQLFFELALGSFQGLVVAVLYCFL 

PTHR LVLMPLFGVHYIVFMATPYTEVSGILWQVQMHYEMLFNSFQGFFVAIIYCFC 

CTRl LILVPLLGVQFVVLPWRPSTPLLGKIYD YVVHSLIHFQGFFVAII YCFC 

**.**.*.,,., . . ***. 



GLPR FnTEVQMEFRKSWERWRLE-RLNIQRDSSMKPLKC 

S ECR NG EVQLEVQKKWRQWH LQ-E FPLRPVAFNNS FSN 

PTHR NGEVQAEIKKSWSRWTLALDFKRKARSGSSTYSYGPMVSHTSVTNVGPRGGL 

CTRl NHEVQGALKRQWNQ YQAQRWAGRRS TRAANAAAATAAAAAAL 



GLPR ^ PTSSVSSGATV 

SECR ATNGPTHSTKA 

PTHR ALSLSPRLAPGAGASANGHHQLPGYVKHGSISENSLPSSGPEPGTKDDGYLN 

CTRl AETV EIPVYICHQEPREEP — AGEEPVVEVEG — 



GLPR GSSVYAATC QNSCS 463 

SECR STEQSRSIP RASH 449 

PTHR GSGLYEPMVGEQPPPLLEEERETVM 585 

CTRl VEVIAMEVLEQE— TSA 482 



Fig. IB 
SUBSTITUTE SHEET 
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25 
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GLP-1 (nM) 
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FIG. 3 
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140 




Peptide concentration (M) 



FI6. 



SUBSTITUTE SHFPT 



wo 93/19175 



PCr/EP93/00697 



6/8 




GLP-1 (nM) 



FIG. 5 
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FIG. 6 



SUBSTITUTE SHEET 
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RAT 




MAVTPSLLRLALLLLGAVGRA6PRPQGATVSLSETVQKWREYRHQCQRFL 


- 50 


RAT 


- 


TEAPLLATGLFCNRTFDDYACWPDGPPGSFVNVSCPWYLPWASSVLQGHV 


-100 


RAT 


- 


YRFCTAEGIWLHKDNSSLPWRDLSECEESKQGERNSPEEQLLSLYIIYTV 


-150 


RAT 


- 


GYALSFSALVIASAILVSFRHLHCTRNYIHLNLFASFILRALSVFIKDAA 


-200 


HUM 


- 


RHLYCTRNYIHLNLFASFILRALSVFIKDAA 


- 31 


RAT 


- 


LKWMYSTAAQQHQWDGLLSYQDSLGCRLVFLLMQYCVAANYYWLLVEGVY 


-250 


HUM 


- 


LKWMYSTAAQQHQWDGLLSYQDSLSCRLVFLLMQYCVAANYYWLLVEGVY 


- 81 


RAT 


- 


LYTLLAFSVFSEQRIFKLYLSIGWGVPLLFVIPW6IVKYLYEDEGCWTRN 

••••••••••••• •• •••••• ••••••••••• 


-300 


HUM 


- 


• •••••••••••• •••«••••••• 

LYTLLAFSVFSEQWIFRLYVSIGW6VPLLFVVPWGIVKILYEDEGCWTRN 


-131 


RAT 


- 


SNMNYWLIIRLPILFAIGVNFLVFIRVICIVIAKLKANLMCKTDIKCRLA 


-350 


HUM 




SNMNYWLIIRLPILFAIGVNFLIFVRVICIVVSKLKANLMCKTOIKCRLA 


-181 


RAT 




kstltlipllgthevifafvmdehargtlrfvklftelsftsfqgfmVav 


-400 


HUM 




KSTLTLIPLL6THEVIFAFVMDEHARGTLRFIKLFTELSFTSFQGLMVAI 


-231 


RAT 




LYCFVNNEVQMEFRKSWERWRLERLNIQRDSSMKPLKCPTSSVSSGATVG 


-450 


HUM 




LYCFVNNEVQLEFRKSWERWRLEHLHIQRDSSMKPLKCPTSSLSSGATAG 


-281 


RAT 




SSVYAATCQNSCS -463 

• • • •••• 




HUM 




SSMYTATCQASCS -294 
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